229 research outputs found

    Using Comparable Corpora to Augment Statistical Machine Translation Models in Low Resource Settings

    Get PDF
    Previously, statistical machine translation (SMT) models have been estimated from parallel corpora, or pairs of translated sentences. In this thesis, we directly incorporate comparable corpora into the estimation of end-to-end SMT models. In contrast to parallel corpora, comparable corpora are pairs of monolingual corpora that have some cross-lingual similarities, for example topic or publication date, but that do not necessarily contain any direct translations. Comparable corpora are more readily available in large quantities than parallel corpora, which require significant human effort to compile. We use comparable corpora to estimate machine translation model parameters and show that doing so improves performance in settings where a limited amount of parallel data is available for training. The major contributions of this thesis are the following: * We release ‘language packs’ for 151 human languages, which include bilingual dictionaries, comparable corpora of Wikipedia document pairs, comparable corpora of time-stamped news text that we harvested from the web, and, for non-roman script languages, dictionaries of name pairs, which are likely to be transliterations. * We present a novel technique for using a small number of example word translations to learn a supervised model for bilingual lexicon induction which takes advantage of a wide variety of signals of translation equivalence that can be estimated over comparable corpora. * We show that using comparable corpora to induce new translations and estimate new phrase table feature functions improves end-to-end statistical machine translation performance for low resource language pairs as well as domains. * We present a novel algorithm for composing multiword phrase translations from multiple unigram translations and then use comparable corpora to prune the large space of hypothesis translations. We show that these induced phrase translations improve machine translation performance beyond that of component unigrams. This thesis focuses on critical low resource machine translation settings, where insufficient parallel corpora exist for training statistical models. We experiment with both low resource language pairs and low resource domains of text. We present results from our novel error analysis methodology, which show that most translation errors in low resource settings are due to unseen source language words and phrases and unseen target language translations. We also find room for fixing errors due to how different translations are weighted, or scored, in the models. We target both error types; we use comparable corpora to induce new word and phrase translations and estimate novel translation feature scores. Our experiments show that augmenting baseline SMT systems with new translations and features estimated over comparable corpora improves translation performance significantly. Additionally, our techniques expand the applicability of statistical machine translation to those language pairs for which zero parallel text is available

    Natural Language Processing and Temporal Information Extraction in Emergency Department Triage Notes

    Get PDF
    Electronic patient records, including the Emergency Department (ED) Triage Note (TN), provide a rich source of textual information. Processing clinical texts to create important pieces of structured information will be useful to clinicians treating patients, clinicians in training, and researchers and practitioners in biosurveillance. This work applies natural language processing (NLP) and information extraction (IE) techniques to the TN genre of text. In particular, it presents the Triage Note Temporal Information Extraction System (TN-TIES), which combines a shallow parser, machine learned classifiers, and handwritten rules to identify, extract, and interpret temporal information in TNs in preparation for the automatic creation of a timeline of events leading up to a patient's visit to the ED. The success of TN-TIES suggests that NLP and IE techniques are appropriate for the genre and that the automatic production of a timeline of TN events is a realistic application

    Uncovering phantom shocks in cardiac patients with an implantable cardioverter defibrillator

    Get PDF
    Background: Implantable cardioverter defibrillator recipients sometimes report “phantom shocks” (PSs), defined as a reported shock lacking objective evidence. The aim of this study was to describe the subjective experience of PSs and their psychosocial correlates using a mixed methods approach. Methods: PS participants were matched on sex and age with individuals who received objective shocks only (OSO). Participants were interviewed and completed measures of posttraumatic stress disorder (PTSD Checklist—Civilian Version), depression and anxiety (Hospital Anxiety and Depression Scale), disease-specific distress (Cardiac Anxiety Questionnaire—CAQ), and social desirability (Socially Desirable Response Set—SDRS). Interviews were analyzed using interpretative phenomenological analysis (IPA). Results: Seventeen male patients participated (PS: n = 9; OSO: n = 8). Three themes emerged from IPA: (1) PS as a somatic experience, (2) the emotional impact of PSs, and (3) searching for meaning. Quantitative analyses showed that both groups exhibited elevated trauma and anxiety levels. Effect size differences (ESD) suggested a medium ESD on depression (P = 0.176, ηp 2 = 0.118) and PTSD (avoidance: P = 0.383, ηp 2 = 0.055, numbing: P = 0.311, ηp 2 = 0.068), and a large ESD on SDRS (P = 0.081, ηp 2 = 0.189), where PS participants, comparatively, exhibited elevated levels. A medium ESD was detected on CAQ-fear (P = 0.237, ηp 2 = 0.092) where OSO participants exhibited greater heart-focused worry. Conclusion: The qualitative and quantitative findings of this mixed method study show convergence in terms of the emotional factors associated with the experience of PSs. PSs are often reported to be indistinguishable from objective shocks, evoking alarm, frustration, and confusion, forcing the individual to face the uncertainties of what to them is a novel and confusing experience. (PACE 2013; 36:673–683)This study has not been funded by any source. AB was supported by an Ontario Graduate Scholarship. Dr. Kovacs has received speaking engagement honoraria from Medtronic Inc. Dr. Katz is supported by the Canadian Institutes of Health Research Canada Research Chair in Health Psychology. This paper is derived, in part, from the first author’s Master’s thesis

    Changes in glacier surface cover on Baltoro glacier, Karakoram, north Pakistan, 2001–2012

    Get PDF
    The presence of supraglacial debris on glaciers in the Himalaya-Karakoram affects the ablation rate of these glaciers and their response to climatic change. To understand how supraglacial debris distribution and associated surface features vary spatially and temporally, geomorphological mapping was undertaken on Baltoro Glacier, Karakoram, for three time-separated images between 2001–2012. Debris is supplied to the glacier system through frequent but small landslides at the glacier margin that form lateral and medial moraines and less frequent but higher volume rockfall events which are more lobate and often discontinuous in form. Debris on the glacier surface is identified as a series of distinct lithological units which merge downglacier of the convergence area between the Godwin-Austen and Baltoro South tributary glaciers. Debris distribution varies as a result of complex interaction between tributary glaciers and the main glacier tongue, complicated further by surge events on some tributary glaciers. Glacier flow dynamics mainly controls the evolution of a supraglacial debris layer. Identifying such spatial variability in debris rock type and temporal variability in debris distribution has implications for glacier ablation rate, affecting glacier surface energy balance. Accordingly, spatial and temporal variation in supraglacial debris should be considered when determining mass balance for these glaciers through time

    Targeted inactivation of integrin-linked kinase in hair follicle stem cells reveals an important modulatory role in skin repair after injury

    Get PDF
    Integrin-linked kinase (ILK) is key for normal epidermal morphogenesis, but little is known about its role in hair follicle stem cells and epidermal regeneration. Hair follicle stem cells are important contributors to newly formed epidermis following injury. We inactivated the Ilk gene in the keratin 15 - expressing stem cell population of the mouse hair follicle bulge. Loss of ILK expression in these cells resulted in impaired cutaneous wound healing, with substantially decreased wound closure rates. ILK-deficient stem cells produced very few descendants that moved toward the epidermal surface and into the advancing epithelium that covers the wound. Furthermore, those few mutant cells that homed in the regenerated epidermis exhibited a reduced residence time. Paradoxically, ILK-deficient bulge stem cells responded to anagen growth signals and contributed to newly regenerated hair follicles during this phase of hair follicle growth. Thus ILK plays an important modulatory role in the normal contribution of hair follicle stem cell progeny to the regenerating epidermis following injury. © 2011 Nakrieko et al

    Temporal variations in supraglacial debris distribution on Baltoro Glacier, Karakoram between 2001 and 2012

    Get PDF
    Distribution of supraglacial debris in a glacier system varies spatially and temporally due to differing rates of debris input, transport and deposition. Supraglacial debris distribution governs the thickness of a supraglacial debris layer, an important control on the amount of ablation that occurs under such a debris layer. Characterising supraglacial debris layer thickness on a glacier is therefore key to calculating ablation across a glacier surface. The spatial pattern of debris thickness on Baltoro Glacier has previously been calculated for one discrete point in time (2004) using satellite thermal data and an empirically based relationship between supraglacial debris layer thickness and debris surface temperature identified in the field. Here, the same empirically based relationship was applied to two further datasets (2001, 2012) to calculate debris layer thickness across Baltoro Glacier for three discrete points over an 11-year period (2001, 2004, 2012). Surface velocity and sediment flux were also calculated, as well as debris thickness change between periods. Using these outputs, alongside geomorphological maps of Baltoro Glacier produced for 2001, 2004 and 2012, spatiotemporal changes in debris distribution for a sub-decadal timescale were investigated. Sediment flux remained constant throughout the 11-year period. The greatest changes in debris thickness occurred along medial moraines, the locations of mass movement deposition and areas of interaction between tributary glaciers and the main glacier tongue. The study confirms the occurrence of spatiotemporal changes in supraglacial debris layer thickness on sub-decadal timescales, independent of variation in surface velocity. Instead, variation in rates of debris distribution are primarily attributed to frequency and magnitude of mass movement events over decadal timescales, with climate, regional uplift and erosion rates expected to control debris inputs over centurial to millennial timescales. Inclusion of such spatiotemporal variations in debris thickness in distributed surface energy balance models would increase the accuracy of calculated ablation, leading to a more accurate simulation of glacier mass balance through time, and greater precision in quantification of the response of debris-covered glaciers to climatic change

    Evolving antimicrobial resistance in a patient receiving palliative OPAT for a vascular graft infection: a case report

    Get PDF
    Prosthetic vascular graft infection is devastating and frequently fatal. Cure requires removal of the graft and reperfusion by placement of a new graft. However, no evidence based guidelines exist for management where removal of the graft is not possible. We describe a patient who lived in a state of chronic infection suppression through outpatient parenteral antimicrobial therapy (OPAT) over a period of 32 months, and outline the challenges experienced and strategies used to suppress infection in the face of escalating antimicrobial resistance. To date there have been very few reports of OPAT used in the palliative context and this case illustrates the microbiological issues that can arise and the importance of the full OPAT multi-disciplinary team in managing these issues and optimising the patient's quality and length of life
    • 

    corecore